Exploiting discourse structure for intelligent paragraph retrieval for why-QA

نویسنده

  • Suzan Verberne
چکیده

Finding answers to why-questions involves finding arguments in texts, rather than the noun phrases that are typical targets for factoid questions. Detecting arguments requires detecting specific rhetorical structures and relations. Therefore, we proposed the use of Rhetorical Structure Theory (RST) as a tool for discovering answer to why-questions in paragraphs that are likely to contain the answer. We evaluated this method using two sets of why-questions: one obtained by elicitation of native speakers and one containing questions that are asked to the online question answering system answers.com. Our procedure was able to find answers to about 60% of the why-questions. We conclude that some relation types have a high predictive power in answer selection, but we also found that many questions require a full paragraph for an answer. Therefore, we need to shift the research emphasis towards passage retrieval. We propose a three-step method for retrieving passages that are likely to contain the answer to a why-question: (1) query creation, (2) document retrieval and (3) paragraph retrieval and ranking. Standard information retrieval models are not suitable for ranking paragraphs as candidate answers. One issue is the small size of the text units that must be ranked. In addition, we need to incorporate information on the presence of RST relations in the language model used for ranking.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Paragraph retrieval for why-question answering Exploiting discourse structure for intelligent paragraph retrieval for why-QA

Finding answers to why-questions involves finding arguments in texts, rather than the noun phrases that are typical targets for factoid questions. Detecting arguments requires detecting specific rhetorical structures and relations. Therefore, we proposed the use of Rhetorical Structure Theory (RST) as a tool for discovering answer to why-questions in paragraphs that are likely to contain the an...

متن کامل

Evaluating Answer Extraction for Why-QA using RST-annotated Wikipedia texts

In this paper the research focus is on the task of answer extraction for why-questions. As opposed to techniques for factoid QA, finding answers to whyquestions involves exploiting text structure. Therefore, we approach the answer extraction problem as a discourse analysis task, using Rhetorical Structure Theory (RST) as framework. We evaluated this method using a set of why-questions that have...

متن کامل

Evaluating paragraph retrieval for why-QA

We implemented a baseline approach to why-question answering based on paragraph retrieval. Our implementation incorporates the QAP ranking algorithm with addition of a number of surface features (cue words and XML markup). With this baseline system, we obtain an accuracy-at-10 of 57.0% with an MRR of 0.31. Both the baseline and the proposed evaluation method are good starting points for the cur...

متن کامل

Boosting Passage Retrieval through Reuse in Question Answering

Question Answering (QA) is an emerging important field in Information Retrieval. In a QA system the archive of previous questions asked from the system makes a collection full of useful factual nuggets. This paper makes an initial attempt to investigate the reuse of facts contained in the archive of previous questions to help and gain performance in answering future related factoid questions. I...

متن کامل

Multi - Paragraph Segmentation of ExpositoryTextsMarti

We present a method for partitioning expository texts into coherent multi-paragraph units which reeect the subtopic structure of the texts. Using Chafe's Flow Model of discourse, we observe that subtopics are often expressed by the interaction of multiple simultaneous themes. We describe two fully-implemented algorithms that use only term repetition information to determine the extents of the s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017